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ABSTRACT 

Objectives: To further identify the single-nucleotide 
polymorphisms (SNPs) that contribute to the genetic 
susceptibility to sarcoidosis, we examined the potential 
association between sarcoidosis and 15 SNPs of the 
ANXA11 gene. 

Design: A case-control study. 

Setting: A tuberculosis unit in a hospital of the 

university in China. 

Participants: Participants included 412 patients with 
sarcoidosis and 418 healthy controls. 
Methods: The selected SNPs were genotyped using 
the MALDI-TOF in the MassARRAY system. 
Results: Statistically significant differences were found 
in the allelic or genotypic frequencies of the 
rs2789679, rs1 049550 and rs2819941 in the ANXA11 
gene between patients with sarcoidosis and controls. 
The rs2789679 A allele (p=0.00004, 0R=1.42, 95% CI 
1.17 to 1.73) and rs2819941 T allele (p=0.0006, 
0R=1.41, 95% C1 1.16 to 1.71) were significantly more 
frequent in patients with sarcoidosis compared with 
controls. The frequency of the rs1 049550 T allele 
(p=0.000002, OR=0.61, 95% CI 0.49 to 0.74) in 
patients with sarcoidosis was significantly lower than 
that in controls. The multi-SNP model reveals that 
rs1 049550 is the only independent SNP association 
effect after accounting for the other two marginally 
associated SNPs. In block 2 (rs1049550-rs2573351), 
the T-C haplotype occurred significantly less frequently 
(p=0.001), whereas the C-C haplotypes occurred more 
frequently (p=0.0001) in patients with sarcoidosis than 
controls. Furthermore, genotype frequency distribution 
revealed that, in rs1 049550, the CC genotype was 
significantly more in patients with chest X-ray (CXR) 
stage I sarcoidosis than in patients with CXR stage 
ll-IV sarcoidosis (p=0.012). 
Conclusions: These findings point to a role for the 
polymorphisms of ANXA11 in sarcoidosis in a Chinese 
Han population, and may be informative for future 
genetic studies on sarcoidosis. 
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INTRODUCTION 

Sarcoidosis is a systemic autoimmune disease 
characterised by destructive, non-caseating 



A systematical screening of the functional single- 
nucleotide polymorphisms in the promoter 
region, 5'-UTR (untranslated region) and 3'-UTR, 
exons of the ANXA11 gene, and the homogeneity 
of study participants representing the Chinese 
Han are the main strengths of this study. 
The lack of data proving the positive association 
observed for rs2789679 and rs1 049550 is a 
potential limitation of this study. Furthermore, 
the association of the serum level of ANXA11 
with sarcoidosis still needs to be investigated. 
Furthermore, the deficient study that investigates 
the association of the protein level of ANXA11 
with sarcoidosis is a potential limitation of this 
study. 



epithelioid granulomatous lesions, with accu- 
mulated phagocytes and activated CD 4 T 
helper type 1 lymphocytes. 1 2 The typical 
manifestations of sarcoidosis include bilateral 
hilar lymphadenopathy, pulmonary infiltra- 
tion and ocular and skin lesions. The clinical 
course and prognosis of sarcoidosis are vari- 
able. In most cases, the acute type of sarcoid- 
osis such as Lofgren's syndrome can resolve 
spontaneously. However, chronic pulmonary 
sarcoidosis may progress to lung fibrosis, 
which eventually causes respiratory failure. 
Recent studies have demonstrated that sar- 
coidosis can occur in a pattern of family clus- 
tering, and its racial incidence is also 
different, suggesting that some genetic 
factors may contribute to the risk and sever- 
ity of sarcoidosis. 1 3 

The Annexin gene family is involved in the 
aetiology of several autoimmune and chronic 
diseases. 4 5 One member of the Annexin 
gene family, ANXAll, located on chromo- 
some 10q22.3, is involved in calcium signal- 
ling, apoptosis, vesicle trafficking, cell growth 
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and the terminal phase of cell division. 5-7 In this 
context, the development and maintenance of the 
granulomatous inflammation in sarcoidosis have been 
repeatedly associated with the impaired apoptosis of acti- 
vated inflammatory cells. 8 9 Most of the early-stage sar- 
coidosis is characterised by mononuclear cell alveolitis 
dominated by activated CD4 T cells and macrophages. 
Between these cells, the uncoordinated interplay results 
in the formation of typical non-caseating granulomas. 
The mechanism by which the granulomas resolve has 
not been fully elucidated. However, it is generally 
assumed that apoptosis and the withdrawal of inflamma- 
tory cytokines are involved in the disappearance of 
granulomas. 10 11 In patients with sarcoidosis, their per- 
ipheral blood monocytes were characterised by 
apoptosis. 9 

In previous studies, case-control/family 'hypothesis- 
driven' studies and low density linkage scans were used 
to identify genetic factors conferring the genetic suscep- 
tibility to sarcoidosis. 12 Notably, the first genome-wide 
association study (GW\S) in sarcoidosis, conducted in a 
German population, has recently revealed an associ- 
ation: the leading rs2789679 SNP (single-nucleotide 
polymorphism (SNP)) located in the 3'-UTR (untrans- 
lated region) and a common non-synonymous SNP 
(rsl049550, C>T, Arg230Cys) were associated with the 
increased risk of sarcoidosis. 4 13 This association has 
recently been supported by another report from the 
same population. 14 Thus, more studies should be per- 
formed to demonstrate the following items: whether 
these SNPs modulate the risk of sarcoidosis by them- 
selves or whether they correlate with other causative 
SNPs and are repeated in other populations. The pub- 
lished studies about the association of sarcoidosis and 
ANXA11 are summarised in online supplementary table 
SI. We hypothesise that common variants in the 
ANXA1 1 gene may significantly contribute to the predis- 
position to develop sarcoidosis. 

In this study, we investigated 15 loci in a Chinese 
population from the He 'nan province (China) to verify 
the putative association between ANXA11 polymorph- 
isms and sarcoidosis. 



SUBJECTS AND METHODS 
Subjects 

Four hundred and twelve patients with sarcoidosis 
(mean±SD age: 53. 6±4.6 years; 155 men and 257 
women) were recruited from our hospital between May 
2005 and March 2013. Sarcoidosis was diagnosed by the 
evidence of non-caseating epithelioid cell granuloma in 
biopsy specimens and chest X-ray (CXR) abnormalities. 
Chronic sarcoidosis was defined as a disease over at least 
2 years or at least two episodes in a lifetime. Acute sar- 
coidosis was defined as one episode of acute sarcoidosis 
which had totally resolved at the date of the examin- 
ation. The control group consisted of 418 unrelated 
healthy participants (mean±SD age: 54.2±5.3 years; 160 



men and 258 women) who underwent health examina- 
tions in the Medical Examination Center of the First 
Affiliated Hospital of the Xinxiang Medical College 
(Xinxiang, China) between October 2009 and 
September 2013. None of the individuals in the control 
group had a history of lung diseases or showed any 
symptoms of the lung or other diseases by CXR or 
laboratory blood tests. Participants were excluded if 
they: were taking other prescribed medications that 
could affect the central nervous system; had a history of 
seizures, hematological diseases or severe liver or kidney 
impairment; or were pregnant. No familial relationship 
was known between the study participants. 

SNP selection 

Tagging SNPs were selected from the catalogues of the 
International HapMap Project. The rs2789679 and 
rs2245168 are located in 5'-UTR, rs2236558 in intron 6, 
rs7067644 and rsl2763624 in intron 8, rsl049550 in 
exon 10, rs2573351 in intron 13, rsl0887581, 
rsl 1201989, rs2573353, rs2789695, rs2573356, rs2819945, 
rsll202059 and rs2819941 in intron 14 (see online sup- 
plementary figure SI). Marker selection was based on 
the following criteria. (A) We used the CHB data from 
the HapMap (release 27) to select tagSNPs for ANXA11. 
(B) We restricted our search for tagSNPs from base pair 
80155124 to base pair 80205677. (C) Further, we limited 
SNPs to tag to those with a minor allele frequency >0.2 
in the CHB. (D) Based on these restrictions, there were 
a total of 29 SNPs. (e) Using HAPLOVIEW with an r 2 
threshold >0.8, 15 tagSNP were selected and used for 
subsequent analyses. 

Genotyping 

Peripheral blood was collected from a vein into a sterile 
tube coated with EDTA. Plasma samples were stored at 
— 20°C. Genomic DNA was extracted from the frozen 
peripheral blood samples using a QIAmp Blood Mini 
Kit (Qiagen Inc., Valencia, California, USA) according 
to the manufacturer's protocols. The selected SNPs were 
genotyped in cases and controls by using the 
MALDI-TOF in the MassARRAY system (Sequenom Inc., 
San Diego, California, USA). Probes and primers were 
designed using the Assay Design Software (Sequenom, 
San Diego, California, USA). The completed genotyping 
reactions were spotted onto a 384 well spectroCHIP 
(Sequenom) using the MassARRAY Nanodispenser 
(Sequenom) and determined by the matrix-assisted laser 
desorption ionisation time-of-flight mass spectrometer. 
Genotype calling was performed in real time with the 
MassARRAY RT software V.3.0.0.4 and analysed using the 
MassARRAY Typer software V.3.4 (Sequenom). 

Statistical analysis 

All data were analysed using the SPSS V.17.0 software 
(SPSS Inc., Chicago, Illinois, USA). The Hardy- 
Weinberg equilibrium was evaluated by % 2 tests. 
Differences between cases and controls in the frequency 
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of the alleles, genotypes and haplotypes were evaluated 
by Fisher's exact test or the Pearson % 2 test. 
Unconditional logistic regression was used to calculate 
the OR and 95% CI in independent association between 
each locus and the presence of sarcoidosis. Gender and 
age of participants were treated as covariants in binary 
logistic regression, p Values were calculated based on 
codominant, dominant for the rare allele, heterosis and 
recessive for the rare allele models of inheritance. 
Models of multiple logistic regression were used to test 
the independence of individual allelic effect. In detail, 
the most significant SNP was chosen to be the conditional 
SNP (covariate in the regression model) when testing 
other significant SNPs; Meanwhile, a multi-SNP model 
including all significant SNPs was also performed. The 
Bonferroni correction was used to adjust the test level 
when multiple comparisons were conducted, and the p 
value was divided by the total number of loci. Haplotype 
blocks were defined according to the criteria developed 
by Gabriel et al 15 Pairwise LD statistics (D f and r 2 ) and 
haplotype frequency were calculated, and haplotype 
blocks were constructed using the Haploview 4.0. 16 To 
ensure that the LD blocks most closely reflect the popula- 
tion level LD patterns, the definition of blocks was based 
on the control samples alone. The haplotype frequencies 
were estimated using GENECOUNTING, which 



computes maximum-likelihood estimates of haplotype 
frequencies from unknown phase data by utilising an 
expectation-maximisation algorithm. 17-20 The signifi- 
cance of any haplotypic association was evaluated using a 
likelihood ratio test, followed by permutation testing that 
compared estimated haplotype frequencies in cases and 
controls. 17 19 



RESULTS 

The genotype distribution of the 15 polymorphisms was 
consistent with the Hardy-Weinberg equilibrium 
(p>0.05). The analysis of strong LD in patients with sar- 
coidosis and healthy controls revealed that 2 SNPs 
(rs2245168, rs2236558), 2 SNPs (rsl049550, rs2573351), 
2 SNPs (rs2789695, rs2573356) and 3 SNPs (rs2819945, 
rsl 1202059, rs2819941) were located in haplotype 
block 1, block 2, block 3 and block 4 (D'>0.9, figure 1). 
The genotype distribution, allelic frequencies and haplo- 
types in patients with sarcoidosis and healthy controls are 
showed in tables 1-3 and online supplementary table S2. 

A comparison of genotype and allele frequency distribu- 
tion revealed significant differences between patients with 
sarcoidosis and healthy controls for 3 SNPs: rs2789679, 
rsl049550 and rs2819941. The frequency of the rsl049550 
T allele (p=0.000002, OR=0.61, 95% CI 0.49 to 0.74) in 
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Figure 1 The LD plot of the 15 SNPs in the ANXA1 1 gene. Values in squares are the pairwise calculation of r 2 (left) or D' 
(right). Black squares indicate r 2 =1 (ie, perfect LD between a pair of (single-nucleotide polymorphisms SNPs)). Empty squares 
indicate D'=1 (ie, complete LD between a pair of SNPs). 
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Table 1 Genotypic and allelic frequencies of ANXA1 1 polymorphisms in controls and patients with sarcoidosis 




Controls (n=41 8) Sarcoidosis (n=41 2) 



\/o ki o k\ Iq 
VdridUK? 


No 


Par ront 
rCI Ut?IH 




Dor /»ont 


n Weill io* 


OR QR°/» PI 
Un, 5JO /o vsi 


rs2789679 










0.0007 




TT 


114 


27.3 


94 


22.8 


0.140 


1.27, 0.93 to 1.74 


AT 


225 


53.8 


186 


45.2 


0.012 


1.42, 1.08 to 1.87 


AA 


79 


18.9 


132 


32.0 


0.0002 


0.49, 0.36 to 0.68 


Per A allele 


383 


45.8 


450 


54.6 


0.00004 


1.42, 1.17 to 1.73 


rs 1049550 










0.0002 




CC 


154 


36.8 


208 


50.5 


0.0007 


0.57, 0.43 to 0.75 


CT 


190 


45.5 


170 


41.3 


0.221 


1.19, 0.90 to 1.56 


TT 


74 


17.7 


34 


8.3 


0.0008 


2.39, 1 .55 to 3.68 


Per T allele 


338 


40.4 


238 


28.8 


0.000002 


0.61,0.49 to 0.74 


rs2819941 










0.0002 




CC 


114 


27.3 


94 


22.8 


0.141 


1.27, 0.93 to 1.74 


CT 


226 


54.1 


190 


46.1 


0.021 


1.38, 1.05 to 1.81 


TT 


78 


18.7 


128 


31.1 


0.0004 


0.51,0.37 to 0.70 


Per T allele 


382 


45.7 


446 


54.1 


0.0006 


1.41, 1.16 to 1.71 



*p Value was calculated by 2x3 and 2x2 % 2 tests based on codominant, dominant for the rare allele, heterosis and recessive for the rare allele 



models of inheritance. 

a Value is adjusted by Bonferroni correction and statistically significant results (p<0.003). 




patients with sarcoidosis was significantly lower than that 
in the controls. These differences remained statistically sig- 
nificant after Bonferroni corrections. The rsl 049550 was 
the most significant of the three. A multi-SNP model 
revealed rsl 049550 to be the only significant independent 
SNP association with sarcoidosis risk. 

The multi-SNP model showed that only rsl 049550 pre- 
sented a significant effect on disease phenotype 
(p<0.001). No independent effect was found for 
rs2789679 or rs2819941 when adjusting for the effect of 
rsl 049550. Individual SNP and multi-SNP analysis sup- 
ported rsl 049550: T allele was an important protective 
factor for affecting sarcoidosis (OR around 0.6). In 
words, carriers of rsl 049550: the C allele has higher sus- 
ceptibility to sarcoidosis in our Chinese Han population 
(see online supplementary table S3) . 

We performed an association analysis to determine 
whether the haplotype was associated with the risk of sar- 
coidosis (Global p<0.05 in block 2 and block 3). 
Compared with controls, the T-C haplotype occurred 
significantly less frequently (p=0.001) but C-C haplo- 
types occurred more frequently (p=0.0001) in block 2 
(rsl049550-rs2573351) in patients with sarcoidosis. 

To assess particular disease phenotypes, patients with 
sarcoidosis were divided into subgroups according to 



their CXR stage. CXR stage I (isolated bilateral hilar 
lymphadenopathy) : N=196; CXR stages II-IV (infiltra- 
tion of parenchyma): 167; CXR stage 0 (absence of pul- 
monary disease): N=28; the information on CXR stage 
was not available for 21 patients. Genotype frequency 
distribution revealed that, in rsl 049550, the CC geno- 
type was significantly more in patients with CXR stage I 
sarcoidosis than in patients with CXR stages II-IV sar- 
coidosis (p=0.012; table 4). 

DISCUSSION 

A key step in linkage and association studies is to identify 
common risk variants in different populations. To deter- 
mine if common risk variants exist in distinct populations, 
15 SNPs which span approximately 0.53 Mb of the 
ANXA11 gene were genotyped in samples from patients 
with sarcoidosis and healthy controls in a Chinese Han 
population. Recently, new sarcoidosis loci have been identi- 
fied by GWASs. 4 14 With the fast development of GWAS, an 
increasing number of susceptibility loci for sarcoidosis have 
been found in different populations. 4 14 21 However, these 
observations should be confirmed in other genetically 
independent populations. In this study, we conducted the 
first large genetic association study of the ANXA1 1 gene in 



Table 2 ANXA1 1 haplotype in block 2 frequencies and the results of their associations with risk of sarcoidosis 


Haplotype 


Genecounting (frequency %) 




ID rsl 049550 rs2573351 


Cases 


Controls 


p Value* Global p valuet 


HAP1 T C 
HAP2 C C 
HAP3 C T 


27.7 
19.9 
50.5 


39.7 
7.4 
51.0 


0.001 0.003 

0.0001 

0.892 



*Based on 10 000 permutations. 

tBased on a comparison of frequency distribution of all haplotypes for the combination of SNPs. 
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Table 3 ANXA1 1 haplotype in block 3 frequencies and the results of their associations with risk of sarcoidosis 


Haplotype 




Genecounting (frequency %) 




ID rs2789695 


rs2573356 


Cases 


Controls 


p Value* Global p Valuet 


HAP1 C 


T 


33.7 


34.9 


0.718 0.045 


HAP2 T 


T 


15.3 


14.1 


0.632 


HAP2 T 


T 


15.6 


14.1 


0.565 



*Based on 10 000 permutations. 

tBased on a comparison of frequency distribution of all haplotypes for the combination of SNPs. 



a Chinese Han population. The evidence of markers asso- 
ciated with sarcoidosis was presented, and these markers 
were mapped to different locations in the ANXA11 gene 
(81897864-81951001). The association signals in the 
region were identified, and some significantly associated 
haplotypes also appeared in this region. 

Hofmann et at reported an association between the 
T allele of the non-synonymous SNP rs 1049550 and sig- 
nificantly decreased risk of sarcoidosis in a German 
population. Similar results were also obtained in other 
two European populations. 14 21 As a part of the sarcoid- 
osis GWAS done in Americans, 22 we further confirmed 
this association in a Chinese Han population. In this 
study, the frequency of the ANXA11 rsl049550 T allele 
was significantly lower in patients with sarcoidosis than 
in healthy controls. Genotype frequency distribution 
revealed that, in rs 1049550, the CC genotype was signifi- 
cantly more in patients with stage I sarcoidosis than in 
patients with stage II-IV sarcoidosis. The 'protective' 
effect of the ANXA11 T allele increased with the 
number of its copies in the genotype, which is consistent 
with the result obtained in patients with sarcoidosis in a 
German population 4 Furthermore, the T allele carriers 
among patients were protected from infiltration of lung 
parenchyma (radiographic stages II-IV). 21 We have also 
demonstrated that rsl 049550 is significant cross- 
ethnically at the gene level after adjustment for the 
single SNP association tests performed. The mechanism 
by which rsl 049550 affects the susceptibility to sarcoid- 
osis is that the SNP may affect the function of the 
ANXA11 protein rather than its expression. In humans, 
the ANXA11 protein consists of an N-terminal 
proline-tyrosine-glycine rich region, followed by four 
annexin core domains. The rsl 049550 leads to an 
amino-acid exchange (basic arginine to a polar cysteine) 
at the evolutionarily conserved position 230 (R230C) in 
the first annexin domain, which is responsible for the 
Ca 2+ -dependent trafficking of the protein in the cell. 23 



In the GWAS conducted by Hofmann et at, the stron- 
gest association signal was observed for rs2 789679. This 
marker is located in the 3'- (untranslated region) UTR of 
the ANXA11 gene. The T allele had a frequency of 35% 
among patients with sarcoidosis and 44% in healthy con- 
trols, with 13% of patients with sarcoidosis and 19% of 
controls being TT homozygous. Fine-mapping in the 
Black Women's Health Study (BWHS) revealed 
rs28 19941 to be the top SNP, associated with a non- 
significant (p=0.06) reduction in the risk of sarcoidosis 
(17% reduction per copy of the C-allele). In this case- 
control association study, the frequency of the A allele in 
rs2789679 and the rs2819941 T allele frequency in 
patients with sarcoidosis were significantly higher than 
they were in healthy controls. Several lines of evidence 
suggest that the observed association is unlikely to be an 
artefact. First, the single-SNP and the haplotype-based 
association analyses support the current finding. 
Second, our samples were from the same geographical 
region. Finally, consistent results were obtained from two 
genetically independent populations (Chinese Han and 
Europeans). Collectively, our results confirmed the 
strong association between ANXA1 1 polymorphisms and 
sarcoidosis, suggesting that ANXA11 represents a strong 
genetic risk factor for sarcoidosis. Since causal SNP in 
ANXA11 was not known, all the SNPs significantly asso- 
ciated with sarcoidosis were actually indirect associations 
through the real causal one. Though rs2789679 and 
rs2819941 were not in LD with rsl049550, it may not tag 
the causal SNP as well as rsl049550. Hence, their indi- 
vidual effects were eliminated when controlling for the 
effect of rsl049550. 

We further investigated the interaction among poly- 
morphisms and observed strong LD. The haplotype ana- 
lysis revealed that significantly more C-C (block 2, 
rsl049550-rs2573351) and significantly fewer T-C hap- 
lotypes (block 2) were found in patients with sarcoidosis. 
These results indicated that patients with C-C (block 2) 



Table 4 Chest X-ray stages of patients with sarcoidosis by genotype of rs2789679, rsl 049550 and rs2819941 





rs2789679 (n, %) 




rsl 049550* (n, %) 




rs2819941 (n, %) 




Stages 


AA AT 


TT 


CC CT 


TT 


CC CT 


TT 


Stage I 
Stages II-IV 
X 2 , p value 


59 (30.1) 92 (47.0) 
46 (27.6) 75 (44.9) 
1.041, 0.594 


45 (23.0) 

46 (27.6) 


101 (51.5) 82 (41.8) 
72 (43.1) 68 (40.7) 
8.807, 0.012 


13 (6.6) 
27 (16.2) 


47 (25.0) 88 (44.9) 
41 (24.6) 73 (43.7) 
0.052, 0.975 


61 (31.1) 
53 (31 .7) 



*p Values for genotype frequency distribution (p<0.05). 
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haplotypes of the ANXA1 1 gene were more prone to sar- 
coidosis. Significantly higher frequencies of T-C haplo- 
types were detected in healthy controls than in patients 
with sarcoidosis, suggesting that they may show protect- 
ive effects against sarcoidosis. Our sample size can 
detect SNP and haplotype associations with 90% and 
85% power, respectively, at a false positive rate of 5%, 
disease prevalence of l%o, disease allele/haplotype fre- 
quency of 0.05/0.03, and a presumed OR of 1.5. To 
some extent, this finding further supports a role of 
ANXA11 polymorphisms in sarcoidosis, while ethnic 
group difference may exist. 

In conclusion, our study suggests a potential role of 
ANXA11 SNPs and their related haplotypes in the 
genetic susceptibility to sarcoidosis. Further studies are 
needed to investigate how these SNPs affect the function 
of ANXA11. A broader examination of the genetic vari- 
ation in ANXA11 in the Han Chinese may reveal other 
variants associated with disease risk. 
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